arXiv:1508.00634v2 [math.CO] 5 Aug 2015 


Excluding hooks and their complements 
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Abstract 

The celebrated Erdos-Hajnal conjecture states that for every n-vertex undirected graph H there exists 
e(H) > 0 such that every graph G that does not contain H as an induced subgraph contains a clique or an 
independent set of size at least n E ^ H K A weaker version of the conjecture states that the polynomial-size 
c.lique/independent set phenomenon occurs if one excludes both H and its complement H c . We show 
that the weaker conjecture holds if H is any path with a pendant edge at its third vertex; thus we give 
a new infinite family of graphs for which the conjecture holds. 


1 Introduction 

The Erdos-Hajnal conjecture is a long-standing and intensely-studied conjecture in Ramsey Theory bringing 
together extremal, structural and probabilistic aspects of graph theory. Informally it says that if a large graph 
G does not contain a fixed graph H as an induced subgraph then G contains a large clique or independent set. 

More formally, a set of vertices in a graph G is homogeneous if it induces a clique or independent set, 
and we denote the largest homogeneous set of G by hom(G). Given a graph H (resp. a family of graphs 
JF = {H ll H 2 , ■ ..}), a graph G is said to be id-free (resp. A-free) if G does not contain H (resp. any member 
of T) as an induced subgraph. The famous Erdos-Hajnal conjecture |2JJ is the following. 

Conjecture 1.1. For every graph H, there exists c(H) > 0 such that if G is an n-vertex H-free graph then 
hom(G) ^ n c ( H h 

The Erdos-Szekeres bounds on Ramsey numbers (201 imply that hom(G) > 1 log 2 n for any n-vertex 
graph G. Furthermore, the lower bounds on Ramsey numbers found by Erdos |18j using the probabilistic 
method imply that for a uniformly random n-vertex graph, hom(G) < O(logn) with high probability. Thus 
the Erdos-Hajnal conjecture suggests that ii-free graphs are quite different from typical (random) graphs 
with respect to homogeneous sets. The best known bound for the above conjecture is due to Erdos and 
Hajnal [25]: they showed that the conjecture above holds if we replace n c ^ with e c ( H )'/ lnn . 

Despite much attention, the conjecture is known to hold for only a limited choice of H. We give some brief 
background here and refer the interested reader to the excellent survey M by Chudnovsky on this fascinating 
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conjecture. The upper bounds on Ramsey numbers due to Erdos and Szekeres |2(T! immediately imply that 
Kk, the clique on k vertices, satisfies the Erdos-Hajnal conjecture. In [2], Alon, Pach and Solynrosi prove the 
Erdos-Hajnal conjecture for a (non-trivial) infinite family of graphs by showing that if the conjecture is true 
for two graphs H i and H 2 , then it is also true for the graph formed by blowing up a vertex of H i and inducing 
a copy of H 2 amongst the new vertices. It is known that the Erdos-Hajnal conjecture holds for all graphs 
on at most four vertices and using the result in [2] immediately implies that the Erdos-Hajnal conjecture 
holds for all graphs on five vertices except the four-edge path, its complement, the cycle on five vertices, and 
a graph commonly called the bull (a triangle with two pendant edges). Chudnovsky and Safra [T5j settled 
the conjecture for the bull, but the question remains unsolved for the other three graphs on five vertices. 

Rather than considering H -free graphs, one can weaken the Erdos-Hajnal conjecture by considering 
hereditary graph classes (i.e. graph classes that are closed under taking induced subgraphs). In this direction 
the following conjecture was proposed, see e.g. m, m- We denote by H c the complement of a graph H. 

Conjecture 1.2. For every graph H, there exists a constant c(H) > 0 such that if G is an n-vertex 
{H,H c }-free graph then hom(G) ^ n c ^ H \ 


Compared to the Erdos-Hajnal conjecture, this conjecture is known to hold for only a few additional 
choices of H. Let Pk be the path on k vertices. Chudnovsky and Seymour m proved Conjecture |1.2| when 
H is Pq. Recently, Bousquet, Lagoutte and Thomasse [ 8 ] generalised this result and showed that, for every 
k , the conjecture above holds when H is taken to be Pk- Our main contribution is to generalise this result 
from fc-paths to what we call fc-hooks, giving a new infinite family of H for which Conjecture |1.2| holds. 

A k-hook , denoted by Hk , is the graph on k + 4 vertices {tq,... ,Vk+ 4 }, where {iq,... ,i>k+ 3 } form a 
(k + 3)-vertex path, and Vk+iVk +4 is a pendant edge. An illustration can be found in Figure [Ta| We call this 
graph a fc-hook (rather than, say, a (k + 3)-hook), since it is more convenient to treat it like a k -vertex path 
with a hook , i.e. a four-vertex path, attached to it. 


Theorem 1.3. For every k > 1 there exists Ck such that if G is an n-vertex {Hk, H{j\-free graph then 
hom(G) > n Ck . 

We now describe some further results. A class Q of graphs is said to have the (weak) Erdos-Hajnal property 
if there exists a constant c > 0 such that every graph G £ Q satisfies hom(G) ^ n c where n is the number 
of vertices in G. Clearly, Conjecture |1.1| is equivalent to the statement that, for every graph H , the class 
of H-free graphs satisfies the Erdos-Hajnal property. Instead of asking for homogeneous sets, one can ask 
for homogeneous pairs: for a graph G and two disjoint subsets of its vertices P and Q , we say that ( P,Q) is 
a homogeneous pair if every edge between P and Q is present or if every edge between P and Q is absent. In 
the former case, we call (P, Q) an adjacent pair and in the latter case an anti-adjacent pair. A graph class Q 
has the strong Erdos-Hajnal property if there exists a constant 8 > 0 such that every G £ Q with at least two 
vertices has a homogeneous pair (P, Q) with |P|, |Q| > <5|H(G)|. It is not hard to show (see, e.g., [I], [H]) that 
if a graph class Q has the strong Erdos-Hajnal property, then it also has the (weak) Erdos-Hajnal property. 

We shall prove Theorem |1.3| by proving that a more general graph class has the strong Erdos-Hajnal property. 
A double k-hook , denoted by Hj:, is the graph on k+8 vertices {tq,..., iq+s}, where the vertices {iq,..., Ufc+6} 
form a (k + 6)-vertex path, and v^Vk +7 and Vk+ 4 ,Vk+s are pendant edges. Again, we prefer to view this graph 
as a /c-vertex path, with a hook attached to each end of the path: hence the name. For an illustration, see 


Vk +4 


Vk+7 


Vk +8 


k 


v k+\ v k +3 


V\ 



Vk+i Vk +6 


(a) A fc-hook. 


(b) A double fc-hook. 
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Let H> k := {Hf, (P|) c : £ > k} i.e. the set of double Shooks and their complements for all £ > k. 
Since the class of { Hk , PjF}-free graphs is a subclass of %> fc -free graphs, Theorem 1.3 is implied by the following. 


Theorem 1.4. For every k> 1, the class of H> k -free graphs has the strong Erdos-Hajnal property. 


Note that the result in [8] mentioned earlier, that the class of {P*.,P£}-free graphs has Erdos-Hajnal 
property, is in fact proved by showing that {Pk, P£}-free graphs have the strong Erdos-Hajnal property. 
Furthermore, Bonamy, Bousquet and Thomasse [5] show that Gk has the strong Erdos-Hajnal property, where 
Gk is the class of graphs that do not contain the cycle Ce on £ vertices or its complement C\ as an induced 
subgraph for all £ > k. 

In the course of the paper, we shall prove that two further hereditary graph classes have the strong 
Erdos-Hajnal property. We believe these results may be of independent interest. A hole in a graph is an 
induced cycle of length at least 4 and an antihole is the complement of such a graph. A Berge graph is a 
graph that does not contain any odd hole or odd antihole. It follows easily from the Strong Perfect Graph 
Theorem that the class of Berge graphs satisfies the (weak) Erdos-Hajnal property, but a certain random 
poset construction [23] shows that it does not satisfy the strong Erdos-Hajnal property. However if we also 
forbid the claw, i.e. the star on four vertices then the strong Erdos-Hajnal property holds. 


Theorem 1.5. The class of claw-free Berge graphs has the strong Erdos-Hajnal property. 

In [3D], Lagoutte and Trunck show that another subclass of Berge graphs has the strong Erdos-Hajnal 
property. This class of graphs is incomparable to the class of claw-free Berge graphs. 

The line graph L{G ) of a graph G is the graph with vertex set E{G ) where ef is an edge in L{G) if and 
only if e and / share a vertex in G. While the class of line graphs is a proper subclass of the class of claw-free 
graphs, it is incomparable to the class of claw-free Berge graphs so the result below gives another hereditary 
class for which the strong Erdos-Hajnal property holds. 


Theorem 1.6. The class of line graphs has the strong Erdos-Hajnal property. 

In fact we shall require weighted versions of Theorem |1.5| and Theorem |1.6[ which we state and prove in 
Section [3] 

We remark that although the strong Erdos-Hajnal property implies the (weak) Erdos-Hajnal property, 
there are graphs H such that the class Q of P-free graphs satisfies the Erdos-Hajnal property, yet the 
class of {P, P c }-free graphs (and thus Q ) fails to satisfy the strong Erdos-Hajnal property. The bull, a 
self-complementary graph, is such an example, as implied by m and the following. 

Theorem 1.7. Let H be a graph. 

(а) The class of H -free graphs has the strong Erdos-Hajnal property if and only if H is an induced subgraph 
of the four-vertex path P 4 . 

(б) If both H and its complement H c contain a cycle, then the class of {H, H c }-free graphs does not have 
the strong Erdos-Hajnal property. 

We expect that the result above, which is proved by a simple random construction, is probably known, but 
we cannot find it recorded anywhere. We give the details in Section [TJ 

While the strong Erdos-Hajnal property requires homogeneous pairs of linear size, if we require homo¬ 
geneous pairs of only polynomial size, then some strong results are known. In [T9], Erdos, Hajnal and Pach 
improve results from m and show that for every graph H there exists c > 0 such that every H- free graph G 
on n vertices admits a homogeneous pair (P, Q) with |P|, |Q| ^ n c . Fox and Sudakov [2(5] showed that in fact, 
every P-free graph G contains either a clique of size n c or an anti-adjacent pair (P, Q) with |P|, \ Q\ ^ n c . 

Finally, we would like to remark that, as shown by Bousquet, Lagoutte, and Thomasse [7], if a hereditary 
graph class satisfies the strong Erdos-Hajnal property, then it also admits a clique-independent set separa¬ 
tion family of polynomial size (for precise definitions we refer the reader to [7j). Consequently, the latter 
conclusion holds for the family for P> fc -free graphs for every fixed k > 1. We point out that a conjecture 
of Yannakakis [36j . stemming from communication complexity and asserting that every graph admits a 
clique-independent set separation family of polynomial size, was very recently disproved by Goos m- 
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Inspiration and methods. The original inspiration for our work conies from a paper of Lokshtanov, 
Vatshelle, and Villanger [521. who used the framework of minimal separators and potential maximal cliques 
to give a polynomial-time algorithm for the (algorithmic) Independent Set problem in Ps-free graphs. Below 
we give some background to this framework without defining all the notions we make reference to; however 
we emphasize that the rest of the paper stands independently from this section. 

A minimal separator in a connected graph is an inclusion-wise minimal set of vertices whose deletion 
leaves the graph disconnected. A minimal triangulation of a graph G is an inclusion-wise minimal set of 
edges F such that G + F, the graph obtained by adding the edges of F to G, is a chordal graph. A potential 
maximal clique of G is a set K C V (G) which is a maximal clique in G + F for some minimal triangulation 
F. These notions all turn out to be closely related through the notion of treewidth and tree decompositions. 

Bouchitte and Todinca [6] studied the notion of potential maximal cliques from this perspective and 
showed that the natural dynamic programming algorithm for finding a maximum independent set in a graph 
of bounded treewidth can be modified to find such a set in time polynomial in the size of G and linear 
in the number of potential maximal cliques in G. In this way, they obtained a unified explanation for the 
existence of polynomial-time algorithms for the Independent Set problem in many hereditary graph classes. 
The work for Ps-free graphs [32] follows the same approach, but generalises it, by showing that in Ps-free 
graphs one needs to examine only a particular (polynomially-sized) set of potential maximal cliques in the 
aforementioned algorithm. Subsequent work m uses minimal separators and potential maximal cliques in a 
different way to develop a quasipolynomial-time algorithm for the Independent Set problem in Pg-free graphs. 

Since the framework of minimal separators and potential maximal cliques has been successfully applied 
to the Independent Set problem for various hereditary graph classes, we wished to investigate to what extent 
these methods are useful for problems related to the Erdos-Hajnal conjecture. While our original proof of 
Theorem |1.3| followed this framework closely, we eventually found a simpler proof which circumvents most 
of the theory, although some artefacts remain. 

Suppose H is a fixed graph and Q is the class of {H, ff c }-free graphs. Our first observation, which is 
essentially expressed in Lemma 4.2 but also requires a result from [25] , is the following. If (for a contradiction) 


the strong Erdos-Hajnal property does not hold for Q, then we may assume that each n- vertex graph G £ Q 
has maximum degree o(n) and a minimal separator of linear size. It immediately follows that G has three 
disjoint subsets of vertices A, B , S where S has linear size, where every vertex of S has at least one neighbour 
in A and B , and where there are no edges between A and B. 

The next step is to use this additional structure of G to form a hook i.e. an induced path on four vertices and 
to grow a fc-vertex induced path from the third vertex of the hook. This gives an induced copy of H & and the 
desired contradiction. In order to obtain the fc-vertex induced path, we apply the simple but ingenious argument 
of Bousquet, Lagoutte, and Thomasse 0 that allows one to grow an arbitrarily long (but constant size) induced 
path in a connected graph with sublinear maximum degree. The main work in our proof is to set up the hook 
so that it will not interfere with the path we wish to grow. If such a hook does not exist, then an involved 
analysis of vertices in S and how their neighbourhoods interact reveals that S has quite a restricted structure: 
in particular we can partition a large part of S such that each pair of parts forms a homogeneous pair and such 
that the ‘quotient graph’ of this partition belongs to a more restricted hereditary graph class than the one we 
started with. This allows us to push through an induction step which gives a linear sized homogeneous pair. 


Structure. The rest of the paper is organised as follows. In Section [2] we provide all necessary definitions 
and tools that we use throughout the paper. In Section [3] we prove weighted versions of Theorem |1.5| and 
Theorem 1.6 In Section[4j we prove Theorem 1.4 using Theorem |3. 11 and a structural result (cf. Theorem 4.3) 
which we prove in Section [6] As a warm-up for this technical result and to illustrate our method, we prove 
a simpler result in Section[5] (cf. Theorem |4.4[ ). In Section [7| we prove Theorem mi We close the paper 
with some concluding remarks in Section [8] 
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2 Preliminaries 


In this section, we fix notation and terminology, and we prove a lemma which we will use several times 
throughout the paper. 

A graph G = (V,E) consists of a set V(G) := V of vertices and a set E(G) := E of edges, where an edge 
is an unordered pair of vertices. A multigraph is defined in the same way except that we allow E(G) to be 
a multiset. A directed multigraph D = {V, A) consists of a vertex set V and an arc multiset A, where an arc 
is an ordered pair of vertices. For graphs and multigraphs we set v(G) : = |V(G)| and e(G) := |P(G)|. We 
denote the complement of a graph G by G c where V{G C ) := V(G) and e € E(G C ) if and only e ^ E(G). For 
an edge e € E(G), we write G\e for the graph on the same vertex set as G and with edge set E(G) \ {e}. 

Let X C V{G) be a subset of the vertices of a graph G. We denote by G[X] the induced subgraph of G 
on X i.e. the graph with vertex set X and edge set E(G[X]) := {uv € E{G): u, v £ X}. We write G — X 
for the induced subgraph of G on V(G) \ X. Let Ng(X) := {u € V(G) \ X : uv £ E(G) for some v £ X} 
denote the (open) neighbourhood of X and let ./Vg[A1] := X U Ng{X) denote the closed neighbourhood of X. 
We omit the subscript if the graph G is clear from context. We write N(v) := N({v}) and N[v] := -/V[{v}]. 
Furthermore, for a set A C V(G) we define Na{X) := N(X) n A and Na[X] := N[X] (~l A. For brevity, if 
X = {x,y}, we write N(x,y) instead of JV({x, y}). For a graph G and two disjoint sets X,Y C V(G), we 
denote by Eq(X,Y) the set of edges of G with one endpoint in X and one endpoint in Y. 

A graph H is called a subgraph of G, denoted by H C G, if V(H) C V(G) and E(H) C E(G[V(H)}). It 
is called an induced subgraph of G if E(H) = E(G[V(H)]). A k-vertex path , denoted by Pfc, is the graph on 
k vertices {iq,... ,Vk} with edge set E(Pk) := {viVi+ 1 : 1 < i < k}. We call a graph connected if for every 
pair of vertices x, y there exists a /e-vertex path P for some k > 1 that is a subgraph of G and that contains 
both x and y. A component in G is a maximally connected subgraph of G. 

A complete graph is one where all possible edges are present. A clique in a graph G is a subset X C V (G) 
such that G[X] is a complete graph and an independent set in G is a subset X C I/(G) such that G[X] is 
the empty graph. In each case, we may also refer to the subgraph G[X] as a clique or independent set. 

We already defined the line graph of a graph, and more generally if G is a multigraph, the line graph 
L(G) of G is the graph with vertex set E{G) where ef is an edge of L(G) if and only if e and / share a 
vertex in G. A graph G' is called a line graph if it is the line graph of some (multi)graph G. 

Given four distinct vertices x, a, 6 , c of a graph G, we say that {x\ a, b , c) is a claw in G, if G[{x , a, b, c}] 
is isomorphic to a claw with x being the degree-three vertex. 

Hooks. For k > 0, recall that a fc-hook, denoted by Hk, is a (k + 3)-vertex path, say on vertex set 
{ui,... ,Vk+ 3 } and edges 1 for 1 < i < k + 2, together with a pendant edge Vk+iVk+ 4 - The vertex v\ 
is called the active vertex of the fc-hook. Note that a 0-hook, denoted H$, is the four-vertex path P 4 , with 
one of its interior vertices designated as an active vertex. 

When constructing an induced fc-hook in a graph G we often start with a 0-hook i.e. a copy of an induced 
P 4 , and then “grow” a path by adding edges subsequently to the active vertex. The following notion is helpful. 
An active k-hook in a graph G is a pair ( X , R ), where X, R C V (G), XPiR = 0, G[X] is isomorphic to a fc-hook, 
G[P] is connected, and N{R) DAT consists of exactly one vertex, namely the active vertex of the fc-hook G[AT]. 

Modules. Frequently, we will encounter sets in our graph G that “behave like a single vertex” in the 
following way. A set X C V (G) is a module in G if for every x, y £ X we have N(x) \X = N(y)\X. For 
a partition V(G) = Xi l±l X 2 W ... W X r into nonempty modules X \,..., X r , observe that, for every i 7 ^ j, 
the pair (Xi,Xj ) is a homogeneous pair. For such a partition, the quotient graph G q is defined to be the 
graph with vertex set {Xi, X 2 , ■ ■ ■, X r } where two sets X, and Xj are connected by an edge in G q if and 
only if they form an adjacent pair in G. Note that a quotient graph is necessarily isomorphic to some induced 
subgraph of G, namely one formed by taking exactly one vertex from every set X 

The following simple lemma is used frequently throughout the paper. 

Lemma 2 . 1 . Let G be a graph, /i be a probability measure on V(G), 6 > 0, and X C V(G) such that 
p(X) > 3<5. Then there exists either a set PCX such that (P,X \ P) is an anti-adjacent pair and 
p(P),p(X \P) > 5, or the largest component of G[X] has measure at least p(X) — S. 
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Proof. Let C be the vertex set of a component of G[X\ such that p(C) is maximal. If piC) > p(X) — 5, then 
we are done. Also, if S < p(C) < p{X) — 5, then we are done by taking P := C. In the remaining case, when 
all components of G[X] have measure at most 5, we proceed as follows. We initiate P := 0, and iterate over 
components of G[A'] one-by-one, putting them into the set P until p(P) exceeds 5. Since every component 
of G[X] has measure at most 5, we have S < p{P) < 25 at the end of the process. Since p(X) > 3 6, we have 
p(X \P) > 6. Furthermore, by construction, P and X \ P form an anti-adjacent pair. This concludes the 
proof of the lemma. □ 

3 Line graphs and claw-free Berge graphs 

In this section, we state and prove weighted versions of Theorem |1.5| and Theorem |1.6| from which those 
theorems immediately follow. 

A graph class Q has the. weighted, strong Erdos-Hajnal property if there exists a constant 5 > 0 such that 
every G £ Q satisfies the following property. For every probability measure p on V ( G ) satisfying p(v) <1 — 25 
for all v £ V(G), there exists a homogeneous pair ( P,Q ) in G with p{P),p{Q) > S. The condition 
that p{y) <1 — 25 for all v £ V(G) is necessary since for degenerate measures, where most of the mass is 
concentrated on one vertex only, we cannot hope to find a homogeneous pair of sufficient mass, for any graph G. 

We shall prove the following two theorems. 

Theorem 3.1. The class of claw-free Berge graphs has the weighted strong Erdos-Hajnal property. 
Theorem 3.2. The class of line graphs has the weighted strong Erdos-Hajnal property. 

Theorem |3.2| is an immediate corollary of the next lemma, where we prove that in any line graph with 
vertex weights, we find either an anti-adjacent pair or a clique of positive mass. 

Lemma 3.3. Let 5i = Then for every graph G that is a line graph of some multigraph H, and every 
probability measure p on V(G), there exists either a clique K in G with p(K) > 35i or an anti-adjacent pair 
( P,Q) in G with p{P) 1 p{Q) > Si. 

Proof. Fix a multigraph H, let G be its line graph, and fix a probability measure p on V(G). By defini¬ 
tion, p is a probability measure on the edges of H. We can find a partition of V(H) = L l±) R such that 
w := p(Eh(L, R)) > f. Such a partition exists since for a uniformly random partition V(H) = Vi l±l V 2 
the expected value of p(En(V 1 , V 2 )) is \- Let H' be the bipartite subgraph of H with V(H') = V(H) and 
E(H') = Eh(L,R). Notice that the line graph of H ' is an induced subgraph of G. We define a function 
/ : V(H) —> [0,1] by f(v) = J2 u -uvge(H') h( uv )- We naturally extend / to subsets of V(H) by summation 
over the elements of the subset. Notice that w := f(L) = f(R) > \. 

Assume that G has no clique of measure at least 35i. Since the set of edges adjacent to a single vertex 
in H' forms a clique in G, we may deduce that f(v) < 35i for every vertex in H'. We find a partition 
L = L\ tt)i 2 of L such that — |5i < /(Li), f(L 2 ) < f + §5i in the following way. Pick u £ L, set L\ := {w}, 
and add vertices from L to L±, one at a time, until f(Li) > j — §. Since we add less that 35i to /(Li) each 
time, the upper bound on f(L 1 ) follows. In the same way, we find a partition R = Ri l±) f ? 2 of R such that 
f-|5 1 </(i? 1 ),/(i? 2 )<f + §5r. 

For i,j £ {1,2}, let ptj := p{EH i {L il Rj)). We have pn + pa = f{Lf) > f — §5i. Likewise, 
pli + P2i = f{Ri) > y ~~ l^ 1- 

If P 12 < 5i or P 21 < 5i, then both pu,p 2 2 > |r — |5i > 5i. In that case, we may take P = 
E H '(Li,Ri),Q = E H i(L 2 , R 2 ), since then (P,Q) is an anti-adjacent pair in G. In the other case, if 
P 12 , P 21 > 5i, we may take P = Eh'(Li, R 2 ) and Q = Eh>(L 2 , Ri), and again observe that (P, Q) is an 
anti-adjacent in G. □ 

We now turn to the proof of Theorem |3.1[ We resort to some known structural results on claw-free graphs. 
Let us first recall some standard terminology that we need. For a graph G, a pair (T, /?) is called a tree 
decomposition of G if T is a tree, /? : V (T) —> 2 V is a function, and the following conditions hold: 
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1. V(G) = U tev(T) 

2. for every v £ V(G) the set {t : v £ /3(t)} induces a connected subgraph of T; 

3. for every edge uv £ P(G) there exists t £ V(T) such that u, v £ /3(f). 

For a tree decomposition (T,/3) and a node t £ F(T), the set /3(f) is called a bag. A subset S' C V(G) 
is called a separator (of G) if there exist two vertices x,y £ V(G) \ S such that x and y lie in different 
components of G — S. A clique separator in G is a set S C V(G) that is a separator of G and such that G[S] 
forms a clique. The following result on the existence of a clique separator decomposition is considered to 
be folklore, see e.g. @j. Since in most references it is phrased as a recursive graph decomposition instead 
of a tree decomposition, we provide the short proof for completeness. 


Lemma 3.4. For every graph G there exists a tree decomposition of G where every bag induces a graph 
without clique separators. 


Proof. We prove the claim by induction on |V(G)|. In the base case, G does not contain any clique separator, 
so we can create a tree decomposition (T, /3) where T consists of a single node t and /3(f) := V(G). 

Otherwise, let S’ be a clique separator in G such that \S\ is minimal. By minimality, there exists a 
component A of G — S such that Ng(A) = S. Let G\ := G[A U S] and G 2 := G — A. By induction, for 
i = 1,2, there exists a tree decomposition (Tj,/3j) of the graph Gj. Since G[S] is a clique, and S appears 
both in Gi and G 2 , for every i = 1,2, there exists a bag U £ V(Ti) such that S C /3j(fj). To conclude, let 
T be the tree formed by taking the disjoint union of T) and T 2 and adding the edge t\t 2 . Set /3(f) := /3j(t) 
if f £ Tj, and observe that (T, /3) is a suitable tree decomposition of G. □ 


The following lemma provides the main reason for considering tree decompositions (with additional suitable 
properties) when studying the strong Erdos-Hajnal property. It is considered folklore in the unweighted case, 
and we refer to it as the central bag argument. 

Lemma 3.5. Let 0 < <5 < ^ be a constant, let G be a graph, let p be a probability measure on V(G), and let 
(T,/3) be a tree decomposition of G. Then there exists an anti-adjacent pair ( P,Q ) in G with p(P),p(Q) > 5, 
or a bag (3(t ) with p(f3(t)) > | — S. 


Proof. Extend /3 to subsets of nodes of T by setting f3(S) := (Jtes/3(0> for every S C V(T). We define an 
orientation of the edges of T as follows. For an edge tit 2 £ E(T), let T t be the component of T\ t\t 2 that 
contains t*, for i = 1,2. Now, orient the edge tit 2 from t\ to f 2 if p(/3(V(Ti))) < p(/3(V(T 2 ))), and orient the 
edge t\t 2 from t 2 to t\ otherwise. Since the tree T has fewer edges than nodes, there exists a node t £ V(T) 
of out-degree zero. For every component C of G — /3(f) there exists a component Tq of T — {f} such that 
G C /3(V(Tc)), by the properties of the tree decomposition. Therefore, p(C) < p(f3(V(Tc))) < since the 
edge between Tq and t is oriented towards t. 

If p(/3 (t)) > \ — S, then we are done, so assume oth erwise. Note that then p(V(G) \ /3(f)) > | + <5 > 3<5 
since, by assumption, 5 < j. Therefore, by Lemma 2.1 there is an anti-adjacent pair (P, Q) in V(G) \ fd{t) 
such that p(P), p{Q) > S, or there is a component C in G — /3(f) with p(C) > p(V(G) \ /3(f)) — 5 > \. Since 
the second outcome is a contradiction, we indeed find an anti-adjacent pair of desired size. □ 


The previous two lemmas allow us to pass to a linear subset Y C V (G) of a graph G with the additional 
property that G\Y] has no clique separators, provided that G has no anti-adjacent pair of linear size. In light 
of Theorem |3.1| we search for a good characterisation of claw-free Berge graphs. Chvatal and Sbihi m show 
that a claw-free graph without clique separators is Berge if and only if it is either “elementary” or “peculiar”. 
A graph is called elementary if its edges can be coloured by two colours in such a way that edges xy and 
yz have distinct colours whenever x and z are nonadjacent. We decide not to give the exact definition of 
a peculiar graph here, but rather we point out that the vertex set of a peculiar graph can be partitioned 
into nine parts that each form a clique. The following is then an immediate implication of Theorem 2 in H3- 
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Theorem 3.6. Let G be a graph that is claw-free, Berge and that has no clique separator. Then G is either 
elementary or the vertex set V(G) can be partitioned into nine sets V(G) = (J^=i such that G[Vi] is a 
clique for each 1 < i < 9. 

We now resort to a characterisation of elementary graphs due to Maffray and Reed [33] that suits our 
purposes better than the original definition. We use the following terminology from [33] , Let G be a graph. 
We call an edge a flat edge in G if it does not appear in any triangle of G. Let xy be a flat edge in G, let X, Y 
be two disjoint sets such that X n V (G) = Y fl V (G) = 0, and let B = ( X, Y ; Exy) be a cobipartite graph, 
that is a graph on vertex set Xl£Y, where B[X] and B[Y] form cliques, such that there is at least one edge 
between X and Y in B. We can build a new graph G obtained from G — {a;, y} and B by adding all possible 
edges between X and N(x)\{y} and between Y and N(y)\{x}. We say that G is augmented along xy, that x 
and y are augmented, and that x is replaced by X and y is replaced by Y. Intuitively, we replace the vertices x 
and y by cliques, and the edge xy by a (non-empty) bipartite graph. It is easy to see that, if x±yi and £ 22/2 are 
independent edges in G, then the graph obtained by first augmenting G along aqyi and then the resulting graph 
along X 2 y 2 is the same as if we had first augmented £ 22/2 and then xyy 1 . This leads to the following definition. 

An augmentation of a graph G is a graph G' that is obtained by augmenting G along the edges of some 
matching of flat edges in G. 

Theorem 3.7 ([33]). A graph G is elementary if and only if it is an augmentation of a line graph of a 
bipartite multigraph. 

We now prove the equivalent of Lemma |3.3| for elementary graphs. 

Lemma 3.8. Let 82 = Then, for every graph G that is an elementary graph, and every probability 
measure y on V(G), there exists either a clique K in G with y{K) > 382 or an anti-adjacent pair ( P,Q) in 
G with y{P),y(Q) > 82 - 


Proof. Let G' be a line graph of a bipartite multigraph B such that G is an augmentation of G', which exists 
by Theorem 3.7 We define a probability measure y' on V(G') in the natural way by setting y'(x) := y(X) 
if a; £ V(G') was augmented and replaced by X, and y'{x) := y{x) otherwise. 

We apply Lemma 3.3 to G' to find either an anti-adjacent pair ( P,Q ) in G' with y'(P), y'(Q) > 8 1 = 282 , 
or a clique, say on vertex set K, such that y'(K) > 3<5i. In the first outcome, note that some vertices of 
P and Q may have been replaced by cliques in the augmentation G, say x\,...,x p £ P are replaced by 
X-\,, X p , and y\,...,y q are replaced by Yi,... ,Y q . Set Pq := (P\{xi ,..., x p })UXi U... U X p and Qg ■= 
{Q\{yi,...,y q })UY 1 U...UY g , and note that (Pq, Qq) is an anti-adjacent pair in G with y(Pc ), y{Qc) > 2<5 2 - 
In the second outcome, we consider two cases, depending on \K\. If \K\ < 2, then the heaviest vertex of 
K corresponds to a clique in G of measure at least 3<5i/2 = 3c>2- Otherwise, if K consists of at least three 
vertices, then none of its edges is flat, and therefore it remains a clique of measure at least 3<5i in G. □ 


We now deduce theorem |3.1| as a corollary. 


Proof of Theorem \3.1\ We prove the weighted strong Erdos-Hajnal property with constant 83 = A. Let G 
be a graph that is claw-free and Berge, and let y be a probability measure on V(G) such that for every 
vertex v of G we have that y(v) < 1 — 283 . In fact, we may assume that y(v) < 83 for all v £ V(G). For 
let v £ V(G) be a vertex such that 8 < y{v) < 1 — 2 S 3 . Then y(N(v)) > 83 or y{V(G) \ _/V[u]) > <5 3 . That 
is, either ({v},N(y)) is an adjacent pair in G of sufficient mass, or ({u},F(G) \ N[v]) is an anti-adjacent 
pair in G of sufficient mass, and we are done. 

Let (T, /3) be a tree-decomposition of G such that every bag induces a subgraph of G without a clique 
separator, which exists by Theorem 3.4 By Lemma 3.5 either there is an anti-adjacent pair (P, Q) in G 


with y(P),y(Q) > S 3 and we are done, or there is a bag Y with y(Y) > — <5 3 . In the second case, we apply 


Theorem 3.6 to G\Y] to infer that G[Y] is either elementary or its vertex set can be partitioned into nine 
cliques. In the latter case, G \Y] contains a clique of measure at least > 3(5 3 and we are done. If G\Y\ is 


elementary, then by Lemma 3.8 G[Y] contains an anti-adjacent pair ( P,Q ) with y(P),y(Q) > ^&y(Y) or a 


clique K with y(K) > A y(Y ). In both cases we are done by the choice of £ 3 , as iky(Y) > 0,5 98 l53 = <5 3 . □ 









4 Double hooks have the strong Erdos-Hajnal property 

In this section, we state two technical lemmas and show how Theorem |1.4| can be derived from them. The 
lemmas will be proved in Sections [5] and [6j 

Fix k > 1 and let Q := {G : G is 'H> fe -free} i.e. the class of all graphs G that are {P/, (P/) L }-free, for 
all £ > k. To prove Theorem |1.4[ we show that Q has the strong Erdos-Hajnal property. That is, we need to 
find a 5 > 0 such that every G £ Q contains a homogeneous pair (P, Q) with |P|, \Q\ > S ■ v(G). Similarly as 
in (5j, our starting point is to pass down to an induced subgraph that is very sparse or very dense. The edge 
density of a graph G is the fraction e(G)/( r ^). The following is due to Fox and Sudakov |25] , improving 
an earlier result of Rodl [35]. 

Theorem 4.1. For every 0 < e < 1/2 and every graph H on at least two vertices there exists a constant 

5 = 5(s,H) such that every H-free graph on n vertices contains an induced subgraph on at least Sn vertices 
with edge density either at most e or at least 1 — e. 


In case the induced subgraph is particularly sparse we find a special structure within it. Let G be a graph, 
and let S = (A, P, S ) be a triple of non-empty subsets of V ( G ). We call the pair (G, S) an e-structured pair if 

1 . A l±) P W S' = F (G), i.e., the sets A, B , S form a partition of V (G); 

2. G[A] and G[B\ are connected; 

3. N(A) = N(B) = S, in particular, there is no edge between A and P; and 

4. for every v £ V(G) it holds that |[xj]| < ejS|. 

Note that if (G, S) is e-strucutred, then it is e'-structured for every e' > e. 

Lemma 4.2. Fix 0 < e < A and let G be a graph on n vertices such that every vertex has at most en 
neighbours. Then 

(a) there exists a homogeneous pair ( P,Q) in G with |P|, |Q| > n/10; or 

( b ) there exist subsets A, P,S C V(G) such that |S| > n/10 and the pair (G[dUBU5],(4,5,S)) is a 
10 e-structured pair. 


Proof. Assume that there is no homogeneous pair (P, Q) with |P|, |Q| > n/10 in G, and let G\ be the largest 
component of G. By Lemma 2.1, G\ has at least 9n/10 vertices. Pick an arbitrary vertex xa in Gi, and 
set A := {xa}- Now add vertices one by one to A, keeping A connected, until |1V[A]| exceeds n/2. Then, 
| IV [A] | < n/2 + en < 3n/5, since we add at most en vertices to IV [A] in each step. Thus, G i — AT [A] has at 
least 3n/10 vertices. Let B be the largest component in G\ — IV[A]. By Lemma 2.1 we may assume that 
\B\ > n/10 and therefore we must also have that |A| < n/10 (otherwise (A, B) is an anti-adjacent pair of 
sufficient size). Thus, |7V(A)| > 2n/5. Furthermore, |7V(A) \ 1V[P]| < n/10, since otherwise, (P, 7V(A) \ 7V[P]) 
is an anti-adjacent pair of sufficient size. Setting S := N(A) D N(B), we see that |5| > 3n/10 by the above 
discussion, and for every v £ V(G) we have that |A/s(u)| < |AT('u)| < en < 10e|Sj. Furthermore it is easy to 
see that in G' = G[A U B U S] we have N(A) = N(B) = S and that G'[A] and G’[B] are connected. Thus 
the pair (G[A U B U S], (A, P, S )) is a lOe-structured pair. □ 


The following theorem is the crucial step in our proof. It states that within an e-structured pair 
(G, (A, P, S)), we either find the desired homogeneous pair of linear size, or a very structured subset S C S 
of linear size, or an Ghook for some £ > k which we can potentially extend to a double Ghook. Recall that an 
active £-hook in a graph G is a pair (A', P), where X, R C V (G), X n R = 0, G[A] is isomorphic to an Ghook, 
G[R] is connected, and N(R ) D X consists of exactly one vertex, being the active vertex of the Ghook G[X]. 


Theorem 4.3. For every k > 0, there exists a constant such that for every 0 < e < £o an d i n every 
e-structured graph (G, (A, P, S)) there exists either 
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1. an anti-adjacent pair (P, Q ) in G with P,Q C S, |P|, \Q\ > e|S|; or 

2. an active £-hook (X,R) in G with £ > k, R C S, and |P| > 2e|S|; or 

3. a subset S C S with |S| > |/5 and a partition S = Si tt) S 2 W ... I±) S m , for some m > 2, such that 

(a) 1| < e|S| for every 1 < i < m; 

(b) every set Sj is a module of G[S}; and 

(c) the quotient graph of this partition of the vertex set of G[S] is a claw-free Berge graph. 

We delay the proof of this theorem until Section]!] Informally, the idea is as follows. Let (G, (A, B , S)) be 
an e-structured pair and assume that G does not contain a homogeneous pair of linear size or an active Ghook. 
After some filtering, we partition the vertices in S into equivalence classes according to their neighbourhoods 
in A UP. Assuming certain subgraphs like the hook are forbidden in G, it turns out that edges and non-edges 
between pairs of vertices in S correspond to a certain behaviour of the neighbourhoods of those vertices in 
A and B. This allows us to deduce that the equivalence classes of the partition on S are in fact modules. 
Furthermore, the quotient graph turns out to have an even more restricted structure in terms of the induced 
subgraphs that are forbidden. 

We believe that these methods can be of further use to approach similar problems. Since the proof of 
Theorem |4.3| is rather technical, we present the following as a warm-up in Section [5] to illustrate our methods, 
although we will need many of the lemmas from Section [5] later. 

Theorem 4.4. For every 1/10 -structured graph (G, (A, P, S)) such that G is both claw-free and C^-free, 
there exists a subset S C S with |S| > |S|/5 and a partition S — S\ tt) S 2 tt)... tt) Se such that: 

1. every set Si is contained in a neighbourhood of some vertex in A; 

2. every set Si is a module ofG[S]; 

3. the quotient graph of this partition of the vertex set of G[S] is a line graph of a triangle-free graph. 


We now prove our main result. 


Proof of Theorem l.f Note that if the theorem holds for k = r then it holds for all 1 < k < r. and so it 
is sufficient to prove the theorem for all k > 2. Thus, fix k > 2 and let Q := {G : G is P?, fe -free} and set 
£0 = So(k) to be the constant from Theorem 4.3 We shall prove the following claim. 


Claim 4.5. Suppose G £ Q has n vertices and maximum degree Eo^/lOO. Then either G has a homogenous pair 
(P, Q ) where |P|, \Q\ > eo^/3000 or we can find an active £-hook ( X , R) for some £ > k, where |i?| > Eon/50. 


Proof of Claim. By Lemma 4.2, either there is a homogeneous pair ( P,Q) in G with |P|, |Q| > n/10 (in 
which case we are done) or there is an fft-st ructured pair (G[A UBUS], (A, B , S)) with |5| > n/10. 

Set Gi := GJAUPUS 1 ]. By Theorem |4.3| there is either 1. an anti-adjacent pair (P, Q) in S with |P|, |<2| > 
fol'S’l > Job n (in which case we are done); or 2. an active f-hook (X,R) with £ > k, R C S and \R\ > 2fg|S|; 
or 3. a subset S' C S with |S| > |S|/5 and a partition S = Si tt) S 2 W ... tH S m , for some m > 2, such that 


(a) |$| < fg|S| < f|S| for every 1 < i < m; 

( b) every set S, is a module of G[S]; and 

(c) the quotient graph of this partition of the vertex set of G[S] is a claw-free Berge graph. 

In the third outcome, we consider the quotient graph G q that has vertex set V q := {Sj : 1 < * < to}, 
and where S,;Sj forms an edge in G q if and only if (Si, Sj) is an adjacent pair. We define a probability 
measure p. on V q in the natural way by setting p(Si) := |Sj|/|S|. Note that, by Property (a), n(Si) < £o/2 
for every vertex Sj in G q . By Property (c), the graph G q is claw-free and Berge. We now invoke Theorem 
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|3.1| to see that there is either a homogeneous pair ( P qi Q q ) in G q with y,(P q ), y(Q q ) > 5 c b, or there is a 
vertex Sj £ V q with n(Si) > 1 — 25 c b, where 5 c b is a constant that can be taken to be jrg (see Section 
[3]). Since for every 1 < i < m we have that n(Si) < Eq/2 < 1 — 2 S c b the first outcome must hold for G q . 
Consider the sets P := U^eP, an d Q Us^q <%• Note that since the S^s are modules and ( P q ,Q q ) 

is a homogeneous pair in G q , then ( P,Q) is a homogeneous pair in G[S] and hence in G. Furthermore 
| P I, | Q | > (5 c b|S'| > nfjf • n > gggg, giving us the homogeneous pair of the desired size. 

Thus we may assume the second outcome holds, where we find an active Ghook (A, R). for some i > k, 
such that R C S and \R\ > 2fg|5| > |gn as required. □ 

To prove the theorem, we must show that there exists a constant S > 0 such that every G £ Q contains 
a homogeneous pair (P, Q) with |P|, \Q\ > S ■ v(G). Fix £ = £q/ 40000, set <5 0 = S(e, H?) where is the 


implies that either G or G c contains an induced subgraph, say Go, on at least So • v(G) vertices with edge 
density at most e. Assume without loss of generality that Go is an induced subgraph of G. By a simple 
averaging argument, Go contains an induced subgraph Gi, with v{G\) > v(Go)/2 > <5o • v{G)/2 and where 
G 1 has maximum degree at most 4e • v(Gi). 

By the claim, either G\ (and hence G) has a homogeneous pair of size at least £0 • w(Gi)/3000 > 5 ■ v(G) 
(and we are done) or G 1 has an active Ghook (A, P) for some i > k with |P| > £0 • v(G i)/50. That is, G[X] 
is isomorphic to an Ghook, say with active vertex x, and N(R) HX = {x}. Set R' := R \ N(x). Note that 
|P'| > \R\ - A(Gi) > £ 0 • w(Gi)/100. 

We can now apply the claim to G 2 = G[R'] since A(G 2 ) < A(Gi) < 4 e • v{G\) < £ 0 1G2'|/100. Thus either 
G 2 has a homogeneous pair of size at least Eol-R'I/fiOOO > 5 ■ v(G) (and we are done) or G 2 has an active Chook 
(A*, R*) for some £ > k > 2. Let P x be the shortest path in the graph G[P] between the vertex x (the active 
vertex of the active Ghook (A, R)) and the set A*, and suppose it meets A* at the vertex y. Such a path 
certainly exists since N(R)r IA = { 2 } and R is connected, and it must be an induced path (since it is a shortest 
path). Now, one easily sees that AU V(P x ) U A* contains an induced copy of a double G-hook, for some G > l 
(no matter where y is in A"*!). (To see that this copy is induced, note that by definition, there are no edges 
between A' and R other than those between x and Na(x).) But this is a contradiction to having G £ Q. □ 


constant from Theorem 4.1 and set <5 = <5o£o/10 7 . Thus since both G and G c are H%- free, 


Theorem 


4.1 


5 Warm up: Proof of Theorem 4.4 


The aim of this section is to provide a proof of Theorem |4.4| that will serve as a warm-up before proving 
the main technical step of this paper, namely Theorem |4.3[ The proofs of Theorems |4.4| and |4.3| follow the 
same general outline, while the technical details in this section are much simpler. To exhibit the similarities 
between the proofs, we use nearly the same subsection structure in this section and the next one, even though 
here some subsections will consist only of a single simple observation. 

Let us fix a 1/10-structured pair (G, ( A,B,S )) such that G is claw-free and Gs-free. Define a binary 
relation R = on S as R = {x,y) if and only if Naub(x) = Naub(u) and we note that this is an equivalence 
relation. Our approach consists of the following steps: 


1. We start with filtering out vertices x £ S that have large neighbourhood in A or in B. Since every 
vertex p £ A U B satisfies | Ag(p)| < |S|/10, a standard averaging argument shows that the number of 
such vertices is small. 


2. Second, for every x £ S, we study nonedges inside neighbourhoods Na{x ); such nonedges turn out to 
be good starting points to construct either a claw (in the case of Theorem 4.41 or a hook (in the case 
of Theorem 4.31. 


3. Then, for every two vertices x,y £ 5, we investigate how the neighbourhoods Na(x) and N^(y) differ, 
depending on whether xy is an edge or a nonedge. Intuitively, we want to prove that if xy £ E(G), 
then the neighbourhoods in A and B cannot change much, while if xy ^ E(G), then they should change 
much or not at all. 
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4. We then collect the main properties we need from the aforementioned steps in the definition of a nice 


and Theorem |4.3| In this section we show that it is sufficient for the relevant £-structured pair to be 
nice in order to find a large set S C S, such that if we restrict the relation R = to S, the equivalence 
classes of this relation form a decomposition of G[S] into modules. 


^-structured pair. We prove that the relevant ^-structured pair is nice, both in the proof of Theorem 4.4 


5. Finally, we show that in the case of Theorem |4.4[ the quotient graph of the aforementioned decom¬ 
position is diamond-free; this, together with being claw-free, implies that the quotient graph is in fact 
a line graph of a triangle-free graph, concluding the proof of Theorem |4.4| 


In the proofs of Theorems 4.4 and 4.3 if a statement is accompanied with a sign (A<-> B), then we also 
claim that the same statement holds with the roles of A and B swapped. 


5.1 Filtering step 

Let Sa = {x £ S : A C N(x)} and similarly define S B ■ A standard averaging argument shows the following: 
Claim 5.1. |S a |,|Sb| < |S|/10. 

Proof. Consider the following random experiment: independently and uniformly at random pick a vertex 
p £ A and x £ S. Since every vertex in A is adjacent to at most |Sj/10 vertices of S, the probability that 
px £ E(G) is at most 1/10. On the other hand, once x £ Sa, we have px £ E{G) regardless of the choice 
of p. Consequently, the probability that x £ Sa is at most 1/10. □ 


Define now S' = S \ (Sa U Sb) and G' = G \ (Sa U Sb)- Since | Sa U Sb\ < |<Sj/5, we have that 
(G', (A,B,S')) is an 1/8-structured pair. 

By re stric ting ourselves to the structured pair (G',(A,B,S')), it suffices to prove the conclusion of 
with stronger condition |Sj > |<Sj/4, but with the additional assumption 


Theorem 


4.4 


V xe s(N A (x)CA)/\(N B (x)CB). (5.1) 

To simplify the notation, in the rest of this section we assume that the input structured graph is only 
1/8-structured, but satisfies already (5.1). 


5.2 Neighbourhoods in A U B 

5.2.1 Nonedges inside a neighbourhood in A 

In the case of claw-free graphs, there are simply no edges inside neighbourhoods in A. 

Claim 5.2 (A •<->■ B). For every x £ S the set N A (x) is a clique. 

Proof. Assume the contrary, let p, q £ N A (x), p ^ q, and pq ^ E(G). Let 2 £ N B (x) be any vertex (it exists 
since N(B) = S). Then (x;p, q, z) is a claw in G, a contradiction. □ 


5.2.2 Neighbourhoods along a nonedge in S 

Claim 5.3 (A •£)• B). For every x,y £ S' with x ^ y, xy E(G), there is no edge between N A (x) fl N A (y ) 
and A \ N A (x, y ). 

Proof. Assume the contrary, let p £ N A (x) D N A (y ) and q £ A \ N A (x, y) with pq £ E(G). Then (p; x, y, q) 
is a claw in G, a contradiction. □ 


5.2.3 Neighbourhoods along an edge in S 

Claim 5.4. For every xy £ F1(G[5]), either N A (x) \ N A (y) or N B (x) \ N B (y) is empty. 

Proof. Assume the contrary, let pr £ IVr(x) \ Nr(y) for T £ {A, B}. Then (x\y,pA,PB) is a claw in G, a 
contradiction. D 
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5.3 Niceness of an e-structure and its corollaries 


In the following definition, we extract some properties of the e-structured pair (G, (A, B, S)) that were proven 
in Claims |5.2[ |5.3[ and |5.4[ and then show what can be deduced from these properties only. Exactly the 
same properties will be proven in the next section, in the more general setting of Theorem |4.3[ and hence 
we will be able to reuse the statements obtained here. 


Definition 5.5. An e-structured pair (G, ( A,B,S )) is called nice if the following holds: 

(NE1) for every x £ S we have A <2 N(x) and B <2 N(x); 

(NE2) ( A -B- B ) for every x,y £ S with x ^ y and xy £ E(G ), if Nb{x) 7 ^ Nb(v), then there is no edge 
between Na(x ) fl NA(y) and A \ Na{x , y ); 

(NE3) for every x,y £ S with x 7 ^ y, xy £ E{G), and Na(x) C Na{v), the sets Na(x) and Na(p) \ Na(x) 
are fully adjacent; 

(El) for every x,y £ S such that xy £ E(G[S}), either Na(x) \ Na{v) = 0 or Nb(x ) \ Ns{y) = 0. 

Note that we have used here the notation (A £7 B ), denoting that the particular condition is required 
to hold also with the roles of A and B swapped. Whenever e is unimportant for the analysis we shall drop 
it from the notation and speak only of a (nice) structured pair. 

Let us now formally verify that the considered structured pair (G, (A, B 1 S)) is nice. 


Claim 5.6. The structured pair (G, (A, B , S)) is nice. 


Proof. Property (NE1) is equivalent to (5.1), Property (NE2) is strictly weaker than the statement of 
Claim 5.3 Property (NE3) is a special case of the statement of Claim 5.2 while Property (El) is exactly 
the statement of Claim [5Al □ 


We start our analysis of nice structured graphs with the following observation. 

Lemma 5.7. If a structured pair (G,{A,B,S)) satisfies Properties (NE1) and (NE2), then for every two 
distinct vertices x,y £ S with xy E(G) we have Na{x) = NA{y) if and only if Nb(x) = Ns(y). 

Proof. Assume by contradiction that for some x,y £ S with x ^ y and xy E(G) we have Na{x) = Na(p) 
but Nb(x) Ns(y )■ By Property (NE2), there is no edge between Na(x) fl JVA(y) = JVa(x) and 
A \ Na{x,p ) = A \ Na{x). However, by Property (NE1) and the assumption N{A) = S , both Na{x) 
and A \ Na(x) are nonempty. This contradicts the connectivity of G[A]. □ 

We now move to a deeper study of the situation treated in Property (NE3). 

Lemma 5.8 (A B). If (G, (A, B, S)) is a nice structured pair, then there do not exist three distinct 
vertices x,y,z £ S with xy,yz E(G) and Na(x ) C Na(p ) C Na{z). 

Proof. Assume the contrary, and let x, y, z be as in the statement. Let p be any vertex of Na{x) and q be any 
vertex of Na(z) \ NA{y). By Lemma [AT] applied to the pair (x,y), we have Nb{x) 7 ^ Ns{y ) since Na(x) 7 ^ 
NA(y)- By Property (NE2) applied to the pair (x, y), we have pq £ E(G ), since p £ Na(x) = Na{x) D Na(jj ) 
and q £ Na{z) \ Na{p) Q A \ NA{x,y). However, Property (NE3) applied to the pair (y,z) implies that 
pq £ E(G), a contradiction. □ 

Recall that we have defined the relation R = on the set S as R = (x,y) if and only if Naub(x) = Naub(v)- 
We now introduce a number of other binary relations on the set S that describe the relation between 
neighbourhoods in AuB. For two vertices x,y £ S we have 

R^{x,y) if and only if Na(x) and Na(p) are incomparable with respect to inclusion, and Nb{x) and Ns(y) 
are incomparable with respect to inclusion; 
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Rj(x,y) if and only if N A (x) = N A (y) and N B (x) ^ N B (y ); 

R B (x, y) if and only if N B (x) = N B (y) and N A (x) ^ N A (y); 

R~(x, y) if and only if N A (x) C N A (y) and N B (x) £> N B (y ); 

R~(x,y) if and only if N A (x) A N A (y) and N B (x) C N B (y). 

Observe that the relations R = , R^, R A , and R B are symmetric, while R~ and i?2 are strongly antisymmetric, 
and R~(x,y) if and only if R^(y,x). Furthermore, all six defined relations are pairwise disjoint. 

Lemma |5.8| implies that, along nonedges in S, the neighbourhoods in A cannot create chains with respect 
to inclusions. As a corollary, we can obtain the following: 

Lemma 5.9. If (G, (A,B,S)) is a nice structured pair, then there exists a set S C S of size at least |Sj/4 
such that for every x,y £ S with x ^ y and xy E(G), either R = (x, y) or R^(x, y). 


Proof. Consider an auxiliary directed multigraph G A defined as follows: we take V(G A ) = S and for every 
x,y £ S with i/j/ and xy ^ E(G) we add an arc (x,y) if N A (x) C N A (y). Let S be the set of vertices 
of S that have positive out-degree in G A , and let S A be the set of vertices of S that have positive in-degree. 
Symmetric ally, define G B and sets S B and S B . For a, ft £ {+,—}, define S a & = S \ (S A U S B ). 


Lemma 


5.8 


implies that S A f)S A = 


and S B C\S B = 


), which in turn implies that S' ++ US' + US' + US = 


S. Consequently, by setting S to be the largest of the sets S a/3 , we have |Sj > |S|/4. The definition of the sets 
S a P ensures that G A [S] and Gs[S] are arcless, that is, for every x,y £ S with xy ^ E(G) it cannot happen 
that N a (x) C N A (y) or N B (x) C N B (y). However, Lemma 5.7 ensures that once N AuB (x) ^ N AuB (y) for 
some x,y £ S with xy ^ E(G), then both N A (x) N A (y) and N B (x) ^ N B (y), and, consequently, Rf(x,y) 
if x, y £ S. This finishes the proof of the lemma. □ 


Summarizing, we obtain the following statement, which says that for every distinct x,y £ S, the existence 
or non-existence of an edge xy can be determined by examining the neighbourhoods of x and y in A U B. 

Theorem 5.10. For every nice structured pair ( G , ( A,B,S)) there exists a set S C S with |Sj > |S|/4 such 
that for every x,y £ S with N AuB (x ) ^ N A uB(y) the following holds. 


1. xy £ E[G) if and only if exactly one of the following holds: R A (x,y), R B (x,y), R~(x,y), or R~{x,y). 

2. xy E[G ) if and only if Rf(x,y). 


Proof. We obtain the set S from Lemma 5.9 The “if” part of the assertion for edges xy and the “only if” 


part of the assertion for nonedges xy is straightforward from Lemma [5.9| while the remaining two implications 
follow from Property (El). □ 


As mentioned at the beginning of this section, we now partition S according to the relation i? = . Observe 
that due to Theorem |5.10[ the presence or absence of an edge between two vertices x,y £ S is determined 
by N AuB (x) and N AuB (y) unless R = (x,y). An immediate corollary is the following. 


and let S\, S%,.. ■, S r be the equivalence classes of the relation R restricted to S. Then every set Si is a 
module of G[S ]. 


Corollary 5.11. Let (G, ( A , B , S)) be a nice structured pair, let S C S be the set obtained from Theorem 


5.10 


As a last step in our analysis of nice structured graphs, we investigate Pa’s in the quotient graph of the 
aforementioned partition of GfS'] into modules. 


and let x,y,z £ S be three distinct vertices belonging to different equivalence classes of the relation R , such 
that xy £ E(G), yz £ E(G), and xz E(G). Then one of the following holds: 


Lemma 5.12. Let (G, (A, B , S)) be a nice structured pair, let S C S be the set obtained from Theorem 


5.10 


• R A (x,y) and R B {y, z); 
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• Rb( x >v) andR A(y> z ); 

• R~{x,y) and R^(z,y); 


• R?(x,y) and R~(z,y); 

Proof. Since xz £ E(G), we have R^(x, z); in particular the sets Na{x ) and Na(z) are incomparable with 
respect to inclusion. If R A (x,y), then the only option from Theorem 5.10 for the edge yz that allows this 
property to happen is R B (y, z); symmetrical claims follow if we swap the roles of A and B and/or the roles 
of x and z. In the remaining case, if neither (x, y) nor (y, z) belongs to R A U R B , then the only way to ensure 
incomparability of Na(x) and Na{z) is to have R~(x,y) and R~(z,y) or R~(x,y) and R~(z,y). O 


In the next lemma we remark that Lemma |5.12| already implies that the quotient graph of the partition 
of G)/?] into equivalence classes of the relation R = is Berge. 


5.10 


Lemma 5.13. Let (G, (A, B , S)) be a nice structured pair and let S C S be the set obtained from Theorem 
Then the quotient graph of the partition of G[S) into equivalence classes of the relation R = is Berge. 

Proof. Assume that the set S contains a sequence xi, x 2 , • ■ ■, Xh of vertices for some odd integer h > 5, such 
that XiXi+i £ E(G ) and XtXi+ 2 ^ E(G) for every 1 < i < h and for indices behaving cyclically modulo h. 
Furthermore, assume that no two vertices Xi are in relation R~. 

Consider the edge XiX 2 , and let us consider four cases, depending on which option of Theorem |5.10| holds 
for this edge. By symmetry between t he si des A and B , we need only consider the cases Rj(xi,x 2 ) and 
R~(:xi,x 2 ). If Ra{xi,x 2 ), then Lemma 5.12 applied to the P 3 x\,x 2 ,X 3 implies that R B (x 2 , 0 : 3 ). Inductively, 
we infer that R A (xi,Xi + 1 ) if i is odd and R B (xi,Xi+ 1 ) if i is even. However, this leads to a contradiction 
as h is odd. A similar situation happens if R~(xi,x 2 ): we have i?£(x,,x,+ 1 ) for odd i and R~(xi , Xj+i) for 
even i. again yielding a contradiction 

We infer that no such sequence X\, x 2 ,... , Xh exists. However, note that such a sequence is present in any 
odd hole in the quotient graph in the question (take the subsequent vertices on the hole) and is present in any 
odd anti-hole as well (if the anti-hole consists of h vertices yi,y 2 , ■ ■ ■ ,yh in this order, take x, = ?/(i[/i/ 2 j)mod/t)- 
We infer that the quotient graph in the question does not contain any odd hole nor anti-hole, and is thus 
Berge. □ 


Let us now wrap up what our analysis of nice structured graphs implies for the proof of Theorem |4.4 


Recall that we are dealing with a structured pair (G, (A, B, S)) where G is claw-free and G 5 -free. Claim 5.6 


implies that (after the filtering step) we are in fact dealing with a nice structured pain _ Theorem 5.10 

provides us with a candidate set S , that we fix for the remainer of this proof. Corollary 


5.11 


implies that 

the relation R = partitions G[S] into modules. Moreover, by construction, every such module Si is contained 
in a neighbourhood of some vertex from A. It remains to analyse the quotient graph of this partition. 


5.4 The quotient graph: Excluding a diamond 


Clearly, the quotient graph of the partition of GfS 1 ] into equivalence classes of the relation R = is claw-free, 
since G is claw-free. In the rest of this section we show that it is also diamond-free. This, together with a 
characterization from [ 2 E; CH] showing that the class of (claw,diamond)-free graphs is exactly the class of 
line graphs of triangle-free graphs, concludes the proof of Theorem |4.4| 


We start by showing that the last two cases of Lemma 5.12 cannot appear if G is claw-free and Gs-free. 


Claim 5.14. Letx,y,z be as in the statement of Lemma 5.12 Then either R A (x,y) and R B {y,z) or R B (x,y) 
and R A (y,z). That is, the last two cases cannot happen. 


Proof. Assume the contrary; by swapping the sides A and B if needed, we can assume that R^(x,y) and 
R~(z,y). Since xz </ E(G ), the sets Na(x) and N A {z) are incomparable with respect to inclusion; let 
p £ Na(x) \ Na(z) and q £ N A {z) \ Na{x). By Claim [5~2j we have pq £ E{G), since p,q £ N A {y). Let 
s £ Nb(v) be any vertex. Observe that {p, x, s, z, q} induce a C5 in G, a contradiction. P 
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We conclude with an observation that without the two cases of Lemma 5.12 excluded in Claim 5.14 
cannot have a diamond in the quotient graph. 


we 


Claim 5.15. The quotient graph of the partition of G[5] into equivalence classes of the relation R is 
diamond-free. 

Proof. Assume the contrary. Let x,y,s,t £ S be four distinct vertices that belong to four different equivalence 
classes of the relation R~. Furthermore, assume that G[{x , y, s, t}\ is isomorphic to a diamond with xy ^ E{G). 
By swapping the sides A and B if needed, by Claim [5A4| applied to the triple x,s,y , we can assume that 
Rj(x,s) and Rfj{y,s). 

Let us now consider two cases of Claim 5.14 applied to the triple x,t,y- If R A (x,t) and R B (y,t), then 
we have N A {s) = Na{x) = N A (t) and N B (s) = N B {y) = N B (t), giving R = (s,t), a contradiction. If R^(x,t) 
and then we have 


N A (s) = N A (x ), N A (t) = N A (y), N B (s) = N B (y ), N B (t) = N B (x). 


(5.2) 


Since xy ^ E(G), we have by Theorem 5.10 that R^(x,y). By (5.2) this implies that R^(s,t), a contradiction 
to the assumption st £ E(G) and Theorem 5.10| □ 
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Proof of Theorem 


4.3 


In this section we prove Theorem |4.3| using the same proof outline as for Theorem |4.4| from the previous 
section. In particular, after a filtering step we will prove that the e-structured pair at hand is actually nice 


(c.f. Definition 5.5), which allows us to apply the tools developed in Section 5.3 


It will be convenient for the proof to split the constant e into three constants e, S, and 7 in the following 
way. We show that, for every k > 0, if e, 6 ,7 are small enough positive constants that satisfy 2e < 7 then 
in every e-structured pair (G, (A,B,S)) there exists either 


1 . an anti-adjacent pair ( P,Q) in G with P,Q C S, |P|, \Q\ > (S|Sj; or 

2. an active Ghook (X,R) in G with (. > k, R C S , and \R\ > 7151; or 

3. a subset S C S with |Sj > |Sj/5 and a partition S = Si ttl S 2 W ... tfcl S m , for some m > 2, such that 

(a) |S'*| < e|5| for every 1 < i < m; 

(b) every set Si is a module of G[5]; and 

(c) the quotient graph of this partition of the vertex set of G[5] is a claw-free Berge graph. 


Instead of giving an explicit formula for e, <5, and 7 , we will state a number of inequalities that these constants 
should satisfy in the course of the proof. Every such inequality will be true for sufficiently small positive 
constants; in particular, taking e = 5 = 2 oo(fc+io) an< ^ T = ioo(fc+io) suffice. 

For two disjoint vertex sets Q and D in a graph G, reach(Q —> D) denotes the set of vertices v of D such 
that in the graph G[QU D\ there is a path from some vertex in Q to v. Equivalently reach(Q —t D) = C HD, 
where G is the union of all components of G[Q U D] that contain at least one vertex of Q. 

Let (G, (A,B,S)) be an e-structured graph for some (small) constant e > 0. 


6.1 Filtering 

we need a stronger filtering step than the one used for Theorem |4.4| we need not 
only to discard vertices of S that are adjacent to the entire set A or B , but all vertices that are adjacent to a 
large fraction of A or B. Furthermore, we need to use a non-uniform measure on A and B, as defined below. 

For every x £ S, we fix one neighbour n A (x) £ N A {x) and one neighbour ir B (x) £ N B (x). We define a 
probability measure y A on A by p A (X) = | 7 r(^ (X)|/|5|. That is, the measure p A corresponds to a random 


In the proof of Theorem 4.3 
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experiment where we choose a vertex x £ S uniformly at random, and output tta(x). Similarly we define 
a probability measure hb on B using the function ttb- 

Let Sa = {x £ S : ha(Na(x)) > 10e} and similarly let Sb = {x £ S : Hb(Nb{x)) > 10e}. A standard 
averaging argument shows the following. 

Claim 6.1. |Sa|, \Sb\ < |S|/10. 


Proof. Consider the following random experiment: independently choose x £ S uniformly at random and 
p £ A according to the measure / ia■ Since every vertex in A is adjacent to at most e|S| vertices of S, the 
probability that px £ E(G) is at most e. On the other hand, conditioning on x £ Sa , we have px £ E(G) 
with probability at least 10e by the definition of Sa- Consequently, the probability that x £ Sa is at most 
1/10. The proof for Sb is symmetric. □ 


By Claim 6.1 we have |SaUS,b| < |S|/5. Consequently, by considering the pair (G \ (Sn U S_b), ( A,B,S\ 
( Sa U Sb))) instead of (G, (A, B , S)), and by suitably adapting the constant e, in the rest of the proof we 
can assume that our e-structured pair (G, (A, B , S)) has the additional property that 


for all x £ S : Ha(Na(x)) < e and Hb(N b (x)) < e. 


( 6 . 1 ) 


However, we now need to exhibit a set S of size at least |S|/4 (instead of |S|/5 in the statement of Theorem 4.31. 

In the remainder of the proof, let us assume that, for some sufficiently small constants e, <5, and 7 , our 
input structured graph (G, ( A,B,S)) does not admit the desired anti-adjacent pai r no r the desired active 

to obtain the set S. 


hook; our goal is to prove that (G, (A, B , S)) is nice and use the results of Section 
Observe that (6.1 1 already implies Property (NE1) for (G, ( A,B,S)). 


5.3 


6.2 A generic claim to find an active hook 

We will encounter several situations that allow us to find an active hook in an e-structured pair. We bundle 
the commonalities in the following claim. 

Claim 6.2 (A <£>■ B). Assume there exist pairwise disjoint sets Z,Q,D C V(G) such that: 

(i) Q,D C A; 

(ii) (Z,D) is an anti-adjacent pair; 

(Hi) for every q £ Q, there exists an integer i > 0 and an i-hook in G[{g} U Z] with q being the active vertex; 
(iv) (\Z\ + k)e + (k + 3)6 + 7 < 1; 

Then /Zyi(reach((3 —> D)) < \Z\e + 6. 

Proof. For a contradiction, assume that ^(reacf^Q — > D)) > \Z\e + 6. Our goal is to construct an active 
Ghook (X,R) with t > k, R C 5, and |i?| > 7 |Sj. 

Let So be the vertex set of the largest component of G[S\iVs[Z]], and let M = S\(So\JNs[Z]). Note that 
|IVs[Z]| < e|Z||S| by Property 4. of an e-structured pair, and so we have |S 0 L) M\ > 35|S| by assumption ( |rv| ). 
Thus, Lemma [2.1 1 implies that \M\ < 5|S|. Hence, we have 

|S\S 0 | < (\Z\e + 6)\S\ (6.2) 

and, by assumption (pv|), 

|S 0 | > (fee + (k + 2)6 + 7 )|S|. (6.3) 

Since /in(reach(<5 —> D)) > \Z\e + 6 , and by the definition of ha, there exists x £ Sq with tta(x) £ 
reach(Q —> D). In particular, there exists a path from Q to So with all internal vertices in D. Let L be 
a shortest such path; note that it is possible that L consists of a single edge, but L contains at least two 
vertices since Q C A and So C S. 
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Let q be the endpoint of L in Q, y be the second endpoint of L, and x be the neighbour of y on L (it 
is possible that x = q). Using assumption ([Til]) , we find an integer io > 0 and an *o~hook with vertex set 
X C {< 7 } U Z and active vertex q. We lengthen this hook with the path L: define i := io + |V'(i) | — 2, 
Xi := X U (V(L) fl D), and := S 0 . Observe that, since Z and D U S 0 are fully anti-adjacent, and L is 
a shortest path from Q to So via D, we have that G[Xj\ is an i-hook with x being the active vertex, and 
N(Ri) 0 Xi = {cc} Consequently, (Xi,Ri) is an active i-hook. 


If i > k, then (6.31 ensures that (X, t , Rf) is a desired active hook, a contradiction. Otherwise, we use the 


path-growing argument of [ 8 ] to turn it into an active fc-hook, using the slack in (6.3) in the process. More 
formally, we build a sequence of active j-hooks (Xj,Rj) for j = i, i + 1,..., k, with Xi C W+i C ... C Xk, 
Sq O Ri D R i+ i D ... D Rk , and additionally maintain that 


|^|>((fc-j)(e + S) + 2<j + 7 )|S|. 


(6.4) 


Clearly, (6.4) holds for j = i (using (6.3) and the fact that Ri = So), while for j = k, (6.4) gives the desired 


lower bound on \Rk\ for the active fc-hook ( Xk,Rk )■ 

Assume that an active j-hook (Xj,Rj) has been constructed for some j < k. Let Vj £ Xj be the 



that the choice of Vj+i, Xj + 1 , and Rj+i ensures that G[Xj + 1 ] is a (j + l)-hook with active vertex Vj+ 1 , and 
N(Rj +1 ) fl X J+1 = finishing the description of the construction of (Xj +1 , Rj+i). Hence, (X k ,R k ) 

is an active fc-hook with R k C S and |i?fc| > qlS 1 !, a contradiction. This concludes the proof of the claim. □ 

In the remainer of the proof we assume that the constants e, 5 , and 7 are sufficiently small such that 

2£ + 3(6e + <5) < 1. (6.5) 


In particular this means that assumption (pv| of Claim 6.2 is satisfied as long as \Z\ < 6 . It also means 
that the bound in the conclusion of Claim 6.2 is small for \Z\ < 6 ; specifically, we can assume that any two 
neighbourhoods Na{x), Na(u) of vertices x,y £ 5, together with any three sets reach(Q —> D) C A obtained 
from Claim [6)2] (applied with \Z\ < 6 ), cannot cover the entire set A. 


6.3 Neighbourhoods in A U B 

6.3.1 Non-edges inside an A- neighbourhood 

We start with proving an analogue of Claim pj~2| 

Claim 6.3 (A -f-)- B ). For every x £ S andp,q £ Na(x), if pq E(G) then Na{p)\Na(x) = NA(q)\NA{x). 
Proof. By contradiction, and using the symmetry between vertices p and q, let us assume there exists 



6.4 Neighbourhoods along a nonedge in S 

We start by proving Property (NE2). 

Claim 6.4 (A £+ B). For every x,y £ S with x ^ y and xy ^ E(G), if Nb{x) ^ Nsiy), then there is no 
edge between Na(x) D Na{p) and A \ Na(x , y). 
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Proof. Let 2 G Nb(x)ANb(v) be any vertex. Observe that the assumptions of Claim 6.2 are satisfied for 
Z = {x , y, z}, Q = NA(x)C\NA(y) and D = A\Na(x , y): for every q G Q the graph G[{z, x , q, y}] is a P 4 with 
q being one of its internal vertices. Hence, p,A(reach(Q D)) < 3e + S. Let us denote F = reach(Q D ); 
our goal is to prove that F = 0. 

Assume the contrary, let p G F and q G Q with pq G E(G). Let D' = D\F and Q' = Na(D'); note that, 
by the definition of F, we have Q' C NA(x)ANA(y)- Furthermore, Claim 6.3 implies that qq' G E(G) for 
every q' G Q': q' has a neighbour in D', while q does not have such a neighbour, and both q and q' belong 
either to Na(x) or to Na(p)- 

Let z x be any vertex in Nb(x ), z y be any vertex in Ns(y), and Z' = {p, q, x, y, z x , z y }. We claim that 


the assumptions of Claim [672] are satisfied for Z', Q', and D': clearly Z' and D' are fully anti-adjacent by 
construction, so it remains only to check assumption Jhij ). 

To this end, consider q' G Q'. By symmetry between x and y, assume q' G Na{x) \ Na(jj )• If pq' G E(G), 
then G[{p, q', x, z x }] is a 0-hook with q' being the active vertex. If pq' ^ E(G), then G[{q,q',p,y,z v }] is a 
1 -hook with q' being the active vertex. 

By Claim 6.2, we infer that p.A(pea.ch{jQ' —> D')) < 5e + 7 . However, by connectivity of B we have 
reach(Q' — y D') = D'. This, together with pa(NA{ x,y)) < 2e by (6.11 and Pa{F) < 3e + S contra¬ 
dicts (|6.5|). □ 


Since the structured pair (G, (A, B , S)) satisfies Properties (NE1) and (NE2), we can use Lemma 5.7 in 
the following, where we prove Property (NE3). 

Claim 6.5 (A 0 B ). For every x, y G S with x ^ y and xy ^ E(G), if Na(x) C Na{]j), then the sets Na(x) 
and Na(p) \ Na{x) are fully adjacent. 


Proof. Since Na(x) C NA{y), Lemma 5.7 implies that Nb{x) Ns(y)- Consequently, Claim [ 6 A| asserts that 
D := A\Na{v) and Na{x) are fully anti-adjacent. That is, if we define Q = Na(D), then Q C NA{y)\NA(x). 
By contradiction, assume there exists z G NA(y) \ Na{x) and p G Na(x) with pz ^ E(G). Claim 6.3 


implies that z ^ Q, as p ^ Q and p,z G Na{p)- Furthermore, Claim 6.3 also implies that z is fully adjacent 


to Q. We also know that p is fully adjacent to Q. We infer that the conditions of Claim [6T2| are satisfied 
for Z = {z,p, x} and the sets Q and D: for every q G Q, the graph G[{z,q,p, x}] is a P 4 with q being one 
of its internal vertices. Consequently, p.A(xea.ch(Q —> D)) < 3e + 6, which stands in contradiction with the 
connectivity of G[A\ and (6.5). □ 


6.5 Neighbourhoods along an edge in S 

In the next three claims we prove Property (El). 


Claim 6.6 (A 0 B). For every x 7 y G GjS 1 ], if there is no edge between NA(x)ANA(y) and A \ NA{x,y), 
then N a (x) = N A (y). 


Proof. By contradiction, assume there exists p G Na(x) AAA(j/); by symmetry, assume p G Na(x) \ A^i(y). 
Let D = A\Na(x , y) and Q = Na{D) C Na{x) n NA(y). Let z be any vertex in Ns(y), and let Z = {p , y, zj. 
Observe that Claim 6.3 implies that p is fully adjacent to Q , as they are both contained in Na(x) and p 


does not have any neighbour in D. Consequently, the assumptions of Claim |(i.2| are satisfied for the sets 
Z, Q, and D: for every q G Q, the graph G[{p,q,y, z}] is a P4 with q being one of the middle vertices. 
Hence, /M(reach((3 — »• D )) < 3e + 6. However, reach(Q — > D) = D by the connectivity of A, and we have 
a contradiction with (6.5). □ 


Claim 6.7 (A e-)- B). For every xy G £?(G[S]), if Na(x) \ NA(y) 7 ^ 0 but /i^(reach(A r y 4 (a:) \ Na{ y) —> 
A \ N A {x,y))) < 6 e + 5, then N B {x) = N B {y)- 


Proof. Let F = reach (N A {x) \ N A (y) A \ N A (x,y)), D = A \ ( N A {x,y ) U F), and Q = N A (D) C N A {y). 

Let p be any vertex in Na{x) \ Na(p) and let z be any vertex in Ns{y)- 
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If N B (y) 2 Nb{x), then let z\ be any vertex of N B {y ) \ N B {x) and define Z = {x,y,z,p,zi}. Otherwise, 
unless Nb(x) = N B (y), Claim 6.6 implies that there exists an edge Z2Z3 with Z2 £ N B (x ) \ N B {y) and 
B \ N b {x, y), and we take Z = {x,y, z,p , z 2 , z 3 }. 

We claim that in both cases the sets Z, Q , and D satisfy the assumptions of Claim [672] Clearly, D and Z 
are fully anti-adjacent, so it remains to check only assumption (jm|. To this end, consider q £ Q. If pq £ E(G), 
then G[{p, q , y , z}] is a P4 with q being one of the middle vertices. Otherwise, Claim [673] implies that q ^ Na{x), 
that is, q £ Na{p) \ Na(x). If the vertex z\ exists, then G'[{y, q, x,p, Zi}] is a 1-hook with q being the active 
vertex. Finally, if the edge Z2Z3 exists, then G[{x, y , q,p, Z2 , z 3 ] is a 2-hook with q being the active vertex. 
We infer that ^(reach((3 —> D)) < 6 e + 6 . However, the connectivity of G[A\ implies that D = 

O: 


zh(Q —> D). This is in contradiction with (6.51 and the assumption pa{F) < 6 e + S. 


Claim 6.8. For every xy £ E(G[S]), either Na(x) \ NA{y) = 0 or N B (x) \ N B {y) = 1 


Proof. Assume the contrary. Since Na{x) 7 ^ Na(p), Claim [677] applied to the side B instead of the side 
A asserts that pB{xe&ch(N B (x) \ N B {y) — > B \ N B (x,y))) > 6 e + <5; in particular, there exists an edge 
ziz 2 £ E(G) with z 1 £ N b (x) \ N B {y ) and z 2 £ B\ N B {x,y). 

Define now Z = {x, y, Zi, Z2}, Q = Na(x) \ Na(p), and D = A\ Na(x , y). Observe that the assumptions 
of Claim 6.2 are satisfied for these sets: for every q £ Q the graph G[{x,q, y, z 3 , z 2 }} is a 1-hook with q 
being its active vertex. Consequently, p.A(xe&ch(Q —> D )) < 4e + <5, a contradiction to Claim 6.7 and the 
assumption N B {x) ^ N B (y )■ □ 


6.6 Niceness and quotient graph 

Summing up, we have so far proven the following. 

Corollary 6.9. The e-structure (G,(A,B,S)) is nice. 

Let us define relations I? = , i?£, I?2 j i?= ; and R j,, on S as in Section 5.3 We apply Theorem 5.10 


obtaining a set S C 5 of size at least |Sj/4; by Corollary 5.11 the equivalence classes of R = restricted to S 
partition S into modules of G[iS]. Furthermore, Lemma 
is Berge. Thus, to conclude the proof of Theorem 4.3 
partition is also claw-free. 


5.13 asserts that the quotient graph of this partition 
, it suffices to show that the quotient graph of this 


6.7 Excluding a claw in the quotient graph 

Claim 6.10. The quotient graph of the partition ofG[S ] into equivalence classes of the relation R = is claw-free. 

Proof. By contradiction, assume there exists a claw (t;x,y,z) in G[S I ] such that no pair of vertices from 
{t, x, y, z} are in relation R~. 


We apply Lemma 5.12 to three P3S contained in the claw (f; x, y , z). Observe that if one of the first two 


outcomes happens for one of P3S, say R A {x,t) and R B (y,t ), then we have R A (z,t) by looking at the P3 on 
vertices y,t,z. Thus we obtain R^(x,z), a contradiction to the properties of S obtained from Theorem 5.10 


We infer that the only two possibilities are P£(x,t), R~(y,t), and R~(z,t), or the symmetrical option 
R~(x,t), R~(y,t), and R~(z,t). By swapping the sides A and B if needed, we may assume that the first 
option happens, that is, Na{x) U Na{p) U Na(z) C ^(t) and N B (t) C N B (x) fl N B (y ) C N B (z). 

Let D = A \ N aU ), Q xy = NA(t) \ (N^(x)AA r J 4 (j/)), and similarly define Q yz and Q xz . Since xy ^ E(G), 
by Theorem 5.10 we have R^(x,y) and there exists p £ N B (x) \ N B (y). Furthermore, observe that also 


P ^ N B (t). We infer that the sets Z = {t,x,y,p}, Q xy , and D satisfy the assumptions of Claim 6.2 for 


every q £ Na{x) fl NA{y) the graph G[{p , x , q , y}} is a P 4 with q being one of the middle vertices, while for 
every q £ AA(t) \ Na(x, y) the graph G[{i, q, x,p, y}} is a 1 -hook with q being its active vertex. Consequently, 
p,A{xea.ch(Q xy —> D )) <4 e + 6. Symmetrically, the same conclusion holds for Q yz and Q xz . 

Note now that Q xy U Q yz U = A^(t), as (XAY) fl (YAZ) fl (ZAX) = 0 for any three sets X, Y , Z. 
Consequently, ^(reacl^iWdt) —► D )) < 3(4e + <5). However, reach(W 4 (t) D) = D by connectivity of 
G[A], and we have a contradiction with (|6.5[). This concludes the proof of the claim, and of Theorem 4.3 □ 
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7 The strong Erdos-Hajnal property is much stronger 

In this section, we prove Theorem Both statements, (a) and (b), are implied by the following lemma. 
Lemma 7.1. Let k > 2 be fixed and let Li be a family of graphs such that 
(PI) every H € TL contains a cycle of length at most k; or 

(P2) for every H £ TL, the complement of H contains a cycle of length at most k. 

Then the class of TL-free graphs does not have the strong Erdos-Hajnal property. 

Proof. Assume first that TL is a family of graphs with Property (PI), i.e. every H £ TL contains a cycle of 
length at most k. For every 6 > 0, we construct a graph Gs , say on n vertices, that is TL- free and that does 
not contain a homogeneous pair (P, Q) with |P|, \Q\ > Sn. 

Fix <5 > 0, let n be large enough, and let G ~ G(n,p) be a random graph on n vertices where every edge 
is present independently at random with probability 

50 


V = 


6 2 n 


Let X k be a random variable that counts the number of cycles of length at most k in G, and for 3 < £ < k, let 
X( be a random variable that counts the number of cycles of length £ in G. By linearity of expectation we have 


E(X fc )=^E W<»f<i 
Therefore, by Markov’s Inequality, 


e=3 


k 

£( 

f=3 


=: C. 


Pr{X k > 3 C) < -. 


(7.1) 


Let Z$ be a random variable that counts the number of homogeneous pairs (P, Q) in G with |P|, \Q\ = . 

Then 

E (Z s ) < 2 n • 2" • (1 - pf^ + 2" • 2" • Z 2 " 2 / 10 , 

where the first term is an upper bound on the expected number of anti-adjacent pairs (P, Q ) and the second 
term is an upper bound on the expected number of adjacent pairs (P, Q). For n large enough we have p < 
so that we can deduce 

E (Z s ) < 2 2n+1 (l -pf™ 2 / 10 < 2 2ra+1 e- p52 " 2 /io, 

where we use 1 — x < e~ x in the last inequality. Therefore, by a standard first-moment argument and our 
choice of p, 


Pr (Z s > 0) = Pr(Z s > 1) < E (Z s ) < 


0 (2n+l) ln(2)-pi5 2 n 2 /10 


< e 


Therefore, with probability at most | + o(l), G satisfies X k > 3 C or Zs > 0. That is, there exists a graph G' 
that has at most 3 C cycles of length at most k, and that has no homogeneous pair (P, Q ) with |P|, \Q\ = [|nj. 
Remove a vertex from every cycle of length at most k to obtain a graph Gs on n r > n/2 vertices with no 
homogeneous pair (P,Q) with |P|, \Q\ > Sn'. In particular, Gs is TL- free, which proves the claim. 

Assume now that the family TL satisfies Property (P2). Then the family TL C := {H c : H £ TL} satisfies 
Property (PI). So, by the first part, for every S > 0 we find a graph Gs , say on n vertices, that is TL c -iiee 
and has no homogeneous pair (P,Q) with |P|, \Q\ > Sn. But then, the collection of graphs G c s shows that 
the family TL cannot have the strong Erdos-Hajnal property either. □ 

We are ready to prove Theorem |1.7| 

Proof of Theorem \l.T\ First, observe that since P| = P4, the class of Pj-free graphs has the strong Erdos- 
Hajnal property by the result of [5]. To prove the implication in the other direction, notice that if H is not 
an induced subgraph of P4 then either H or H c contains a cycle. But then we can apply Lemma |7.i| to 


TL = {H} and we are done. Thus, we proved statement (a). Statement ( b ) follows from Lemma 7.1 by taking 
TL = {H,H C ). ' □ 
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8 Conclusions 


We proved in this paper that for every k > 1, the class of ’H> fc -free graphs has the strong Erdos-Hajnal property. 
Specifically, there exists e{k) > 0 such that every H % k -free ro-vertex graph contains a clique or an independent 
set of size at lest This result extends, e.g., the result on forbidding long paths and antipaths [5] . 

The only trees on six vertices that cannot be obtained through the substitution method described in [2] 
are the path Pq and the 2-hook, also known as the E-graph. Therefore, Conjecture |1.2| is now known to be 
true for every tree H on at most six vertices. 

The question of excluding pairs of graphs in the context of the Erdos-Hajnal conjecture was considered 
also in the directed setting (see: 0)- The directed version of the conjecture is equivalent to the undirected 
one and was recently heavily investigated ([S EH HEM). In the directed setting the analogue of the 
complement of the graph is the graph obtained by reversing directions of all the edges. It would be interesting 
to see whether techniques presented in this paper can be applied in the directed setting to get generalisations 
of some of the known results. 
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